Finding the Homology of Submanifolds with High Confidence from Random Samples
نویسندگان
چکیده
Recently there has been a lot of interest in geometrically motivated approaches to data analysis in high dimensional spaces. We consider the case where data is drawn from sampling a probability distribution that has support on or near a submanifold of Euclidean space. We show how to “learn” the homology of the submanifold with high confidence. We discuss an algorithm to do this and provide learning-theoretic complexity bounds. Our bounds are obtained in terms of a condition number that limits the curvature and nearness to self-intersection of the submanifold. We are also able to treat the situation where the data is “noisy” and lies near rather than on the submanifold in question.
منابع مشابه
Identification of Human Chromosome Segments that Have High Homology with Rat Genomic DNA
This study was conducted to determine the location of DNA segment with homology to the rat conserved genomic DNA in human chromosomes. The labeled rat genomic DNA was hybridized with normal human (male) metaphases. The study of 74 metaphases after fluorescence in situ hybridization showed 371 twin-spot signals on human chromosomes. Statistical analysis indicated that the specific accumulation o...
متن کاملReconstructing Functions from Random Samples
From a sufficiently large point sample lying on a compact Riemannian submanifold of Euclidean space, one can construct a simplicial complex which is homotopy-equivalent to that manifold with high confidence. We describe a corresponding result for a Lipschitz-continuous function between two such manifolds. That is, we outline the construction of a simplicial map which recovers the induced maps o...
متن کاملSampling from social networks’s graph based on topological properties and bee colony algorithm
In recent years, the sampling problem in massive graphs of social networks has attracted much attention for fast analyzing a small and good sample instead of a huge network. Many algorithms have been proposed for sampling of social network’ graph. The purpose of these algorithms is to create a sample that is approximately similar to the original network’s graph in terms of properties such as de...
متن کاملDistribution Free Confidence Intervals for Quantiles Based on Extreme Order Statistics in a Multi-Sampling Plan
Extended Abstract. Let Xi1 ,..., Xini ,i=1,2,3,....,k be independent random samples from distribution $F^{alpha_i}$، i=1,...,k, where F is an absolutely continuous distribution function and $alpha_i>0$ Also, suppose that these samples are independent. Let Mi,ni and M'i,ni respectively, denote the maximum and minimum of the ith sa...
متن کاملStatistical Topology Using the Nonparametric Density Estimation and Bootstrap Algorithm
This paper presents approximate confidence intervals for each function of parameters in a Banach space based on a bootstrap algorithm. We apply kernel density approach to estimate the persistence landscape. In addition, we evaluate the quality distribution function estimator of random variables using integrated mean square error (IMSE). The results of simulation studies show a significant impro...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Discrete & Computational Geometry
دوره 39 شماره
صفحات -
تاریخ انتشار 2008